Search CORE

EDP Sciences OAI-PMH repository (1.2.0)

Archivio della Ricerca - Università di Salerno

ART

CERN Document Server

Open Access Repository

From Nonspecific DNA–Protein Encounter Complexes to the Prediction of DNA–Protein Interactions

Author: A Sarai
AV Morozov
BW Matthews
BW Matthews
CG Kalodimos
CH Yan
CO Pabo
E Fraenkel
E Katchalski-Katzir
FK Winkler
H Tjong
I Bonnet
IB Kuznetsov
Ilya Vakser
J Gorman
J Skolnick
J Skolnick
JE Donald
Jeffrey Skolnick
JJ Havranek
JS Lamoureux
JS Lamoureux
M Billeter
M Gao
M van Dijk
MJ Sippl
Mu Gao
N Bhardwaj
NC Horton
NM Luscombe
NP Stanford
O Givaty
P Aloy
P Rotkiewicz
PH von Hippel
R Mendez
R Samudrala
RMA Knegtel
S Ahmad
S Jones
SE Halford
SJ Hubbard
TA Robertson
TW Siggers
W Humphrey
WJ Lane
XJ Lu
Y Zhang
Y Zhang
ZJ Liu
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 01/01/2009
Field of study

©2009 Gao, Skolnick. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.doi:10.1371/journal.pcbi.1000341DNA–protein interactions are involved in many essential biological activities. Because there is no simple mapping code between DNA base pairs and protein amino acids, the prediction of DNA–protein interactions is a challenging problem. Here, we present a novel computational approach for predicting DNA-binding protein residues and DNA–protein interaction modes without knowing its specific DNA target sequence. Given the structure of a DNA-binding protein, the method first generates an ensemble of complex structures obtained by rigid-body docking with a nonspecific canonical B-DNA. Representative models are subsequently selected through clustering and ranking by their DNA–protein interfacial energy. Analysis of these encounter complex models suggests that the recognition sites for specific DNA binding are usually favorable interaction sites for the nonspecific DNA probe and that nonspecific DNA–protein interaction modes exhibit some similarity to specific DNA–protein binding modes. Although the method requires as input the knowledge that the protein binds DNA, in benchmark tests, it achieves better performance in identifying DNA-binding sites than three previously established methods, which are based on sophisticated machine-learning techniques. We further apply our method to protein structures predicted through modeling and demonstrate that our method performs satisfactorily on protein models whose root-mean-square Ca deviation from native is up to 5 Å from their native structures. This study provides valuable structural insights into how a specific DNA-binding protein interacts with a nonspecific DNA sequence. The similarity between the specific DNA–protein interaction mode and nonspecific interaction modes may reflect an important sampling step in search of its specific DNA targets by a DNA-binding protein

Scholarly Materials And Research @ Georgia Tech

CiteSeerX

Biophysical and electrochemical studies of protein-nucleic acid interactions

Author: A Abi
A Aravin
A Danhel
A Danhel
A Re
A Simonova
A Szabo
AA Gorodetsky
AA Travers
AL Feig
AM Cobb
AN Kawde
Andrew M. Cobb
B Deng
B Dey
B McConnell
BOS Scott
C Schildkraut
CA Hunter
CN N’Soukpoe-Kossi
CR Calladine
CS Yeh
D Rhodes
DB Dogini
DG Brown
DJ Richard
DK Hendrix
E Farjami
E Palecek
E Palecek
E Palecek
E Palecek
E Palecek
E Palecek
E Palecek
E Palecek
E Palecek
E Palecek
E Palecek
E Stejskalova
EB Jagelska
EB Jagelska
EJ Nam
EM Boon
EM Boon
EM Boon
FU Hartl
G Zauner
GP Yan
H Cahova
H Pivonkova
Hana Pivonkova
HF Wang
J Balintova
J Balintova
J Batra
J Coufal
J Labuda
J Majka
J Vacek
J Wang
JA McClellan
JD Puglisi
JL Wang
JL Wang
K Cahova-Kucharikova
K Nemcova
K Nemcova
K Teilum
KV Morris
L Havran
L Jen-Jacobson
LD Hansen
LL Pang
LM Hellman
Ludek Havran
M Bartosik
M Brazdova
M Brazdova
M Brenowitz
M Fojta
M Fojta
M Fojta
M Fojta
M Fojta
M Fojta
M Fojta
M Fojta
M Heyrovsky
M Hocek
M Kampmann
M Masarik
M Oda
M Willander
Miroslav Fojta
NB Leontis
NB Muren
NC Seeman
NN Salim
O Bell
P Zhang
PA Garrity
PC Weber
PH Hippel von
PH Hippel von
PJ Hagerman
R Christova
R Kaptein
RE Dickerson
Richard P. Bowater
RJ Falconer
RP Bowater
S Khezrian
S Lee
S Tan
SA Coulocheri
SE Halford
SR Holbrook
SR Rajski
T Hianik
TM Hall
TS Furey
V Brazda
V Ostatna
V Petri
V Ramakrishnan
V Vargova
W Filipowicz
WD Kohn
XB Yin
XX He
Y Xiao
Y Xiao
Y Zhang
YH Xu
Z Yang
ZS Wu
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 02/02/2015
Field of study

This review is devoted to biophysical and electrochemical methods used for studying protein-nucleic acid (NA) interactions. The importance of NA structure and protein-NA recognition for essential cellular processes, such as replication or transcription, is discussed to provide background for description of a range of biophysical chemistry methods that are applied to study a wide scope of protein-DNA and protein-RNA complexes. These techniques employ different detection principles with specific advantages and limitations and are often combined as mutually complementary approaches to provide a complete description of the interactions. Electrochemical methods have proven to be of great utility in such studies because they provide sensitive measurements and can be combined with other approaches that facilitate the protein-NA interactions. Recent applications of electrochemical methods in studies of protein-NA interactions are discussed in detail

University of East Anglia digital repository

King's Research Portal

A reexamination of information theory-based methods for DNA-binding site identification

Author: A Kolb
AR Fernandez De Henestrosa
B Barash
CE Lawrence
CE Shannon
D Betel
D GuhaThakurta
DT Pride
EN Trifonov
ET Jaynes
ET Jaynes
G Robertson
G Thijs
GD Stormo
GD Stormo
GD Stormo
GE Crooks
GJ Phillips
GZ Hertz
I Erill
Ivan Erill
J Rudnick
J van Helden
JJ Kohler
JM Heumann
JT Kim
JW Gibbs
K Gaston
K Uchida
KL Griffith
L Kozobay-Avraham
LJ Sun
LL Gatlin
LL Gatlin
M Abella
M Asayama
M Butala
M Schnarr
MC O'Neill
MC O'Neill
MC O'Neill
MH Zweig
Michael C O'Neill
ML Bulyk
MS Gelfand
N Baichoo
O Aparicio
O Huisman
OG Berg
OG Berg
P D'Haeseleer
PH von Hippel
PH von Hippel
R Brent
R Jauregui
R Munch
R Munch
R Osada
R Staden
RJ Redfield
RK Shultzaberger
RK Shultzaberger
RK Shultzaberger
RV Parbhane
S Krishna
S Kullback
ST Cole
TD Schneider
TD Schneider
TD Schneider
TD Schneider
TD Schneider
TL Bailey
TL Bailey
X Liu
Z Chen
Z Xiaoyue
Publication venue: BioMed Central
Publication date: 01/02/2009
Field of study

Abstract Background Searching for transcription factor binding sites in genome sequences is still an open problem in bioinformatics. Despite substantial progress, search methods based on information theory remain a standard in the field, even though the full validity of their underlying assumptions has only been tested in artificial settings. Here we use newly available data on transcription factors from different bacterial genomes to make a more thorough assessment of information theory-based search methods. Results Our results reveal that conventional benchmarking against artificial sequence data leads frequently to overestimation of search efficiency. In addition, we find that sequence information by itself is often inadequate and therefore must be complemented by other cues, such as curvature, in real genomes. Furthermore, results on skewed genomes show that methods integrating skew information, such as <it>Relative Entropy</it>, are not effective because their assumptions may not hold in real genomes. The evidence suggests that binding sites tend to evolve towards genomic skew, rather than against it, and to maintain their information content through increased conservation. Based on these results, we identify several misconceptions on information theory as applied to binding sites, such as negative entropy, and we propose a revised paradigm to explain the observed results. Conclusion We conclude that, among information theory-based methods, the most unassuming search methods perform, on average, better than any other alternatives, since heuristic corrections to these methods are prone to fail when working on real data. A reexamination of information content in binding sites reveals that information content is a compound measure of search and binding affinity requirements, a fact that has important repercussions for our understanding of binding site evolution.</p

Springer - Publisher Connector

Single-Molecule Analysis Reveals the Kinetics and Physiological Relevance of MutL-ssDNA Binding

Author: A Guarne
A Robertson
AM van Oijen
C Ban
C Ban
C Joo
C Joo
Changill Ban
CJ Fischer
Daekil In
EJ Sacho
G Luo
G Obmolova
I Rasnik
J Gorman
J Kosinski
JB Lee
JE Bronson
JJ Warren
Jong-Bong Lee
Jonghyun Park
JY Lee
K Drotschmann
LE Mechanic
MC Hall
MH Lamers
MJ Schofield
P Modrich
PH Von Hippel
R Dutta
R Roy
Richard Fishel
S Acharya
S Kim
S Myong
SM Bende
T Selmane
Vladimir N. Uversky
Yongmoon Jeon
Publication venue: Public Library of Science
Publication date: 01/01/2010
Field of study

DNA binding by MutL homologs (MLH/PMS) during mismatch repair (MMR) has been considered based on biochemical and genetic studies. Bulk studies with MutL and its yeast homologs Mlh1-Pms1 have suggested an integral role for a single-stranded DNA (ssDNA) binding activity during MMR. We have developed single-molecule Förster resonance energy transfer (smFRET) and a single-molecule DNA flow-extension assays to examine MutL interaction with ssDNA in real time. The smFRET assay allowed us to observe MutL-ssDNA association and dissociation. We determined that MutL-ssDNA binding required ATP and was the greatest at ionic strength below 25 mM (KD = 29 nM) while it dramatically decreases above 100 mM (KD>2 µM). Single-molecule DNA flow-extension analysis suggests that multiple MutL proteins may bind ssDNA at low ionic strength but this activity does not enhance stability at elevated ionic strengths. These studies are consistent with the conclusion that a stable MutL-ssDNA interaction is unlikely to occur at physiological salt eliminating a number of MMR models. However, the activity may infer some related dynamic DNA transaction process during MMR

CiteSeerX

포항공과대학교

Evolutionary tradeoffs in cellular composition across diverse bacteria

Author: A Moya
A Seybert
A Zaslaver
A-C Chien
AC Lloyd
AM Makarieva
B Luef
BJ Shuter
Christopher P Kempes
CP Kempes
D Tempest
D Tempest
F Fegatella
GA Mackie
GB West
H Huber
H Jakubowski
HN Schulz
I Golding
J DeLong
J Errington
Jan P Amend
JH Brown
JJ Turner
John Doyle
K Valgepea
KJ Niklas
L Dethlefsen
L Xu
Lawrence Wang
LR Comolli
M Scott
M Simon
N Lane
NW Goehring
P Lu
PH von Hippel
R Geider
R Milo
RL Burnap
S Cayley
T Maier
Tori Hoehler
Publication venue: Nature Publishing Group
Publication date: 05/04/2016
Field of study

One of the most important classic and contemporary interests in biology is the connection between cellular composition and physiological function. Decades of research have allowed us to understand the detailed relationship between various cellular components and processes for individual species, and have uncovered common functionality across diverse species. However, there still remains the need for frameworks that can mechanistically predict the tradeoffs between cellular functions and elucidate and interpret average trends across species. Here we provide a comprehensive analysis of how cellular composition changes across the diversity of bacteria as connected with physiological function and metabolism, spanning five orders of magnitude in body size. We present an analysis of the trends with cell volume that covers shifts in genomic, protein, cellular envelope, RNA and ribosomal content. We show that trends in protein content are more complex than a simple proportionality with the overall genome size, and that the number of ribosomes is simply explained by cross-species shifts in biosynthesis requirements. Furthermore, we show that the largest and smallest bacteria are limited by physical space requirements. At the lower end of size, cell volume is dominated by DNA and protein content—the requirement for which predicts a lower limit on cell size that is in good agreement with the smallest observed bacteria. At the upper end of bacterial size, we have identified a point at which the number of ribosomes required for biosynthesis exceeds available cell volume. Between these limits we are able to discuss systematic and dramatic shifts in cellular composition. Much of our analysis is connected with the basic energetics of cells where we show that the scaling of metabolic rate is surprisingly superlinear with all cellular components

Caltech Authors

Inferring Binding Energies from Selected Binding Sites

Author: A Sarai
AE Kel
C Tuerk
Christopher Workman
DA Gilchrist
David Granas
DS Fields
DSF Homsi
E Roulet
E Sharon
Gary D. Stormo
GD Stormo
GD Stormo
GD Stormo
GD Stormo
H Ji
HF Teh
HG Roider
J Linnell
J Liu
JB Kinney
JJ Moré
L van Oeffelen
M Djordjevic
M Djordjevic
MF Berger
ML Lee
MQ Zhang
O Berg
PH von Hippel
PV Benos
PV Benos
Q Zhou
R Staden
SJ Maerkl
TH Cormen
TK Blackwell
TK Man
U Gerland
V Mustonen
VH Nagaraj
WE Wright
X Liu
X Meng
Y Takeda
Yue Zhao
Publication venue: Public Library of Science
Publication date: 01/01/2009
Field of study

We employ a biophysical model that accounts for the non-linear relationship between binding energy and the statistics of selected binding sites. The model includes the chemical potential of the transcription factor, non-specific binding affinity of the protein for DNA, as well as sequence-specific parameters that may include non-independent contributions of bases to the interaction. We obtain maximum likelihood estimates for all of the parameters and compare the results to standard probabilistic methods of parameter estimation. On simulated data, where the true energy model is known and samples are generated with a variety of parameter values, we show that our method returns much more accurate estimates of the true parameters and much better predictions of the selected binding site distributions. We also introduce a new high-throughput SELEX (HT-SELEX) procedure to determine the binding specificity of a transcription factor in which the initial randomized library and the selected sites are sequenced with next generation methods that return hundreds of thousands of sites. We show that after a single round of selection our method can estimate binding parameters that give very good fits to the selected site distributions, much better than standard motif identification algorithms

University of Essex Research Repository

Digital Commons@Becker

The Influence of Transcription Factor Competition on the Relationship between Occupancy and Affinity

Author: A Marcovitz
Boris Adryan
CC Fowlkes
D Chu
DT Gillespie
DT Gillespie
DT Gillespie
E Segal
Frances M. Sladek
GD Stormo
GD Stormo
GD Stormo
GK Ackers
H Flyvbjerg
HG Roider
J Elf
J Zeitlinger
JS van Zon
L Bintu
L Bintu
L Mirny
M Djordjevic
M Hedglin
M Kampmann
M Riley
M Santillan
MD Biggin
N Rosenfeld
Nicolae Radu Zabet
NR Zabet
NR Zabet
NR Zabet
NR Zabet
OG Berg
OG Berg
P Hammar
PH von Hippel
R Hermsen
Robert Foy
S Thomas
SJ Maerkl
T Kaplan
T Raveh-Sadka
T Wasson
U Gerland
Y Zhao
Z Wunderlich
Publication venue: 'Public Library of Science (PLoS)'
Publication date: 27/03/2013
Field of study

Transcription factors (TFs) are proteins that bind to specific sites on the DNA and regulate gene activity. Identifying where TF molecules bind and how much time they spend on their target sites is key to understanding transcriptional regulation. It is usually assumed that the free energy of binding of a TF to the DNA (the affinity of the site) is highly correlated to the amount of time the TF remains bound (the occupancy of the site). However, knowing the binding energy is not sufficient to infer actual binding site occupancy. This mismatch between the occupancy predicted by the affinity and the observed occupancy may be caused by various factors, such as TF abundance, competition between TFs or the arrangement of the sites on the DNA. We investigated the relationship between the affinity of a TF for a set of binding sites and their occupancy. In particular, we considered the case of the transcription factor lac repressor (lacI) in E.coli, and performed stochastic simulations of the TF dynamics on the DNA for various combinations of lacI abundance and competing TFs that contribute to macromolecular crowding. We also investigated the relationship of site occupancy and the information content of position weight matrices (PWMs) used to represent binding sites. Our results showed that for medium and high affinity sites, TF competition does not play a significant role for genomic occupancy except in cases when the abundance of the TF is significantly increased, or when the PWM displays relatively low information content. Nevertheless, for medium and low affinity sites, an increase in TF abundance (for both cognate and non-cognate molecules) leads to an increase in occupancy at several sites. © 2013 Zabet et al

arXiv.org e-Print Archive

Queen Mary Research Online

FigShare

Accurate Prediction of Inducible Transcription Factor Binding Intensities In Vivo

Author: Adam Siepel
AG Robertson
André L. Martins
AP Boyle
B Tursun
DA Gilchrist
E Sharon
G Hu
H Lee
H Li
H Sakurai
H Tao
HH He
J Liu
JH Friedman
John T. Lis
JR Hesselberth
L Narlikar
M Fritsch
MF Berger
Michael J. Guertin
Michael Snyder
MJ Guertin
MJ Guertin
N Hayashida
PH von Hippel
PV Kharchenko
R Chen
R Gordan
R Pique-Regi
S John
S John
S Lin
S Pepke
SC Biddie
SE Gonsalves
T Barrett
T Kaplan
TC Voss
TL Bailey
W Wu
X He
X Liu
XY Li
Y Enoki
Y Field
Y Zhang
Publication venue: Public Library of Science
Publication date: 01/01/2012
Field of study

DNA sequence and local chromatin landscape act jointly to determine transcription factor (TF) binding intensity profiles. To disentangle these influences, we developed an experimental approach, called protein/DNA binding followed by high-throughput sequencing (PB–seq), that allows the binding energy landscape to be characterized genome-wide in the absence of chromatin. We applied our methods to the Drosophila Heat Shock Factor (HSF), which inducibly binds a target DNA sequence element (HSE) following heat shock stress. PB–seq involves incubating sheared naked genomic DNA with recombinant HSF, partitioning the HSF–bound and HSF–free DNA, and then detecting HSF–bound DNA by high-throughput sequencing. We compared PB–seq binding profiles with ones observed in vivo by ChIP–seq and developed statistical models to predict the observed departures from idealized binding patterns based on covariates describing the local chromatin environment. We found that DNase I hypersensitivity and tetra-acetylation of H4 were the most influential covariates in predicting changes in HSF binding affinity. We also investigated the extent to which DNA accessibility, as measured by digital DNase I footprinting data, could be predicted from MNase–seq data and the ChIP–chip profiles for many histone modifications and TFs, and found GAGA element associated factor (GAF), tetra-acetylation of H4, and H4K16 acetylation to be the most predictive covariates. Lastly, we generated an unbiased model of HSF binding sequences, which revealed distinct biophysical properties of the HSF/HSE interaction and a previously unrecognized substructure within the HSE. These findings provide new insights into the interplay between the genomic sequence and the chromatin landscape in determining transcription factor binding intensity

CiteSeerX

Cold Spring Harbor Laboratory Institutional Repository

FigShare

E. coli metabolic protein aldehydealcohol dehydrogenase-E binds to the ribosome: a unique moonlighting action revealed

Author: A Apirakaramwong
A Basle
A Des Georges
A Favre
A Korostelev
A Kucukelbir
A Sandikci
AS Spirin
AV Zavialov
B Das
BKAF Rath
BS Schuwirth
CJ Jeffery
D Huber
D Kessler
DE Brodersen
DN Wilson
E Crooke
E Martinez-Hackert
E Oh
E Villa
F Merz
G Butland
H Nikaido
J Frank
J Knappe
J LeBarron
J Membrillo-Hernandez
J Sengupta
JR Hillebrecht
K Chen
KH Nierhaus
L Ferbitz
LA Kelley
LG Trabuco
M Gerstein
M Jiang
M Laughrea
M Selmer
N Ghosh
NA Baker
P Echave
PaN Traub
PH von Hippel
PV Sergiev
Q Guo
S Takyar
SS Chen
T Saio
T Schirmer
T Wagenknecht
TM Schmeing
TR Shaikh
TR Shaikh
TR Sundermeier
V Chandran
V Samuel Raj
X Qu
Y Hashem
Z Shajani
Publication venue: 'Springer Science and Business Media LLC'
Publication date: 01/01/2016
Field of study

It is becoming increasingly evident that a high degree of regulation is involved in the protein synthesis machinery entailing more interacting regulatory factors. A multitude of proteins have been identified recently which show regulatory function upon binding to the ribosome. Here, we identify tight association of a metabolic protein aldehyde-alcohol dehydrogenase E (AdhE) with the E. coli 70S ribosome isolated from cell extract under low salt wash conditions. Cryo-EM reconstruction of the ribosome sample allows us to localize its position on the head of the small subunit, near the mRNA entrance. Our study demonstrates substantial RNA unwinding activity of AdhE which can account for the ability of ribosome to translate through downstream of at least certain mRNA helices. Thus far, in E. coli, no ribosome-associated factor has been identified that shows downstream mRNA helicase activity. Additionally, the cryo-EM map reveals interaction of another extracellular protein, outer membrane protein C (OmpC), with the ribosome at the peripheral solvent side of the 50S subunit. Our result also provides important insight into plausible functional role of OmpC upon ribosome binding. Visualization of the ribosome purified directly from the cell lysate unveils for the first time interactions of additional regulatory proteins with the ribosom